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Abstract 

We bring to bear an empirical model of the distribution of twin primes and produce two distinct 
results. The first is that we can make a quantitative probabilistic prediction of the occurrence of gaps 
in the sequence of twins within the primes. The second is that the "high jumper" i.e., the separation 
with greatest likelihood (in terms of primes) is always expected to be zero. 
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1 Introduction 

In a recent paper 0] , hereafter referred to as " TPF we introduced a novel empirical characterisation of the 
sequence of twin primes. Notation, nomenclature and conventions developed in TPI are carried over here. 
There are three salient features of the model. First is the viewpoint that the sequence of twins is most 
naturally and usefully considered as a subset of the primes rather than in terms of the natural numbers. 
The second feature is that for the finite subset of primes less than some number TV, twins occur in the 
manner of fixed-probability random events. I.e., there is a constant probability that each successive prime 
following a given twin is itself the first member of the next twin. The third feature is that this "constant" 
probability is not universal. It varies in a rather simple way with the length of the sequence of primes up to 
N. 

In this paper, we investigate a number of aspects of and make predictions which follow directly and 
indirectly from this new model for the distribution of twins. In the next section, we shall briefly describe the 
details of our empirical model. The following section consists of two separate results. First are probabilistic 
predictions for the occurrences of large gaps in the sequence of twins. These predictions are verified by 
direct comparison in the regime in which the model was constructed. One important comment that must 
be made is that the gaps that we note are separations in the sequence of primes. Those usually quoted in 
the literature are arithmetic differences, or spacings in the natural numbers. The second result we report 
is that the so-called "high-jumper" (the most probable prime separation) is zero in our model. Our results 
do not admit a direct comparison with high-jumper analyses extant in the literature which are concerned 
with most probable natural number separations between twins less than a given N . 

2 The Model 

The model that we consider is empirical in that it is derived from a direct analysis of the distribution of 
twin primes less than 2 x 10^^. The essential feature which provides the key to the success of the model is 

^ patrick_kelly@ndsu.nodak.edu 



1 



that the distribution of twins is considered in the context of the primes alone rather than within the natural 
numbers. The model is based upon the observation that twins less than some number N seem to occur as 
fixed-probability random events in the sequence of primes. That is, there is a characteristic distribution of 
prime separations which may expressed in the form 

V{s,7Ti) = me-"'' . (1) 

Here, tti is the number of primes less than or equal to N, s is the prime separation (the number of unpaired 
singleton primes occuring between two twins), and m is a decay parameter which is constant for a given N, 
but varies with tti. P(s, tti) is the continuous probability density that a given pair of twins in the sequence 
of primes up to N has prime separation s. It is properly normalized: Jq V{s, tti) ds = 1. 

We chose a representative sample of prime sequences and determined the decay constants for each. We 
began our analysis with (5 7), discarding the anomalous twin (3 5). The variation of the decay parameters 
- the slopes on a plot of log(frequency) versus separation ~ is well-described by the following function: 

, , (1.321 ±.008) 
log (tti) 

We refer the interested readers to TP/ for detailed discussion of the derivation of this result and reinforcement 
of the claim that this form for m is consistent with the Prime Number Theorem, the Twin Prime Conjecture, 
and the Hardy-Littlewood Conjecture for twins. 



3 Two Results 

Armed with this model for the distribution of twins we now undertake some analyses of its predictions. 
3.1 Gaps 

Those who are actively enumerating twin primes are quite interested in the occurrences of large gaps ||^, ^, 
^, H . In our probabilistic model we can make estimates for the threshold number at which we might expect 
particular large prime separations to appear. The analysis below does just that. There is a caveat however: 
the gaps that we determine are prime separations, while the gaps referred to in the literature are gaps in the 
natural number sequence {i.e., arithmetic differences between an element of a twin and the corresponding 
element of its immediate predecessor) . 

To determine likelihood thresholds for large prime separations, we need to convolve the behaviour of m 
as a function of log(7ri) with its role in expressing the probability (or frequency) of separations V. Simply 
put, the probability that a particular large prime separation sl occurs is (we set ds — As — 1) 

X Too / mo SL \ 

V{SL, TTi) = - — -— exp -- — — - . (3) 
log(7ri) V log(7ri)y 

In this expression for the probability, we have merely taken (|l|) and inserted our empirical fit for m (|^) where, 
for convenience, we have written toq instead of the central value of the constant 1.321. The probability is 
the expected normalised frequency of each type of gap among the 7r2 twins (actually there are 7r2 — 1 gaps 
between the twins, but we drop the -1 in this analysis). 

While this is completely correct as it stands, and provides the probability that a prime separation of 
magnitude Sl occurs somewhere within the subset of first primes with length tti, this not quite what is 
wanted. Instead of looking for a gap of length sl, we should be considering any gap of length greater than 
or equal to s^. To accomplish this, we integrate V from sl to oo, and obtain the probability that a prime 
separation > sl occurs 
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Let us set a minimum threshold of / separation "events" out of the total number, 1:2 , where / is a risk 
factor aUowing one to be more cautious (/ > 1) or more daring (/ < 1). Fixing 



'P>{sl,tti) 



leads to a relation among the size of the gap, the prime and twin counts, and the risk factor: 

SL = — log(7ri) (log(7r2) - log(/)) . 
mo 



(5) 



(6) 



To cast this in a more useful form, we will utilise the well-known approximate formulae for tti and 1:2 quoted 
in TPI. 



N 



log{N) 

With these approximations, (|^) becomes 
1 



, n2iN) ~ 2c2 



N 



(log(iV))2 



SL 



Too 



\og{N) - log(log(7V)) \og{N) - 2 log(log(7V)) + log(2c2) - log(/) 



(7) 



(8) 



While this equation does not admit simple inversion to determine N{sl), it is readily graphed. We do 
this below up to prime separations of about 250 allowing us to compare the predictions of our probabilistic 
model with the data. 
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Figure 1: Prime gap vs. log(7V), for mo = 1.321, and 0.1 < / < 10. The points mark the onset of successive 
maxima in the spectrum of gaps. 



As one can see from the graph, the probabilistic model yields predictions which conform extremely well 
with the available data. The shape of the curves which bound our cautious-daring range is consistent with 
the Hardy-Littlewood Conjecture for twins in that the twins become more sparse among the primes as N 
increases. 
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3.2 High-Jumpers 

The high-jumper at any fixed N is the natural number separation which occurs most frequently between 
consecutive primes less than N. The high-jumpers are observed to increase with N Exactly the same 
analysis can be carried over to the case of twins. 

We refine the notion of a twin high-jumper to conform to our viewpoint that it is the distribution of 
twins among the primes themselves, not the natural numbers, which is significant. In other words our twin 
high-jumper is the most probable prime separation at any given N. With our probabilistic decay model for 
the occurrence of twins, it necessarily follows that the most likely high-jumper for twins is always zero. 

We note that this claim is crucially dependent upon our model for the slope —m tending to 0~ (or any 
finite negative value) rather than a positive value as N — > oo. Furthermore, we recognise that since the 
slowly-varying probability model for the occurrence of twins is stochastic, there may exist ranges of N for 
which the high-jumper is actually 1 or 2 or greater. 

4 Conclusion 

In this paper, we report on two results that follow straightforwardly from our novel characterisation of the 
sequence of twin primes as slowly- varying probability random events among the primes themselves. The 
first is that the model can be used to make probabilistic predictions for the onset of appearance of prime 
separations of a given (large) size. We have done this and the predictions agree very well in the regime 
where we have data. We believe that the predictions extrapolate and remain valid beyond the range of our 
data. The second result is that the so-called twin prime high-jumper, the mode prime separation, is always 
predicted to be zero by the very structure of the empirical model. 

5 Acknowledgements 

PFK and TP thank J. Calvo and J. Coykendall for helpful comments. This work was supported in part by 
the National Science Foundation (USA) under grant #OSR-9452892 and a Doctoral Dissertation Fellowship. 

References 



[1] P.F. Kelly and Terry Pilling, Characterization of the Distribution of Twin Primes, math. NT/0103191 



[2] Richard P. Brent, Irregularities in the Distribution of Primes and Twin Primes, Mathematics of Com- 
putation, 29,(1975) 43-56. 

[3] Richard P. Brent, The distribution of Small Gaps Between Successive Primes, Mathematics of Compu- 
tation, 28,(1974) 315-324. 

[4] Thomas R. Nicely, New maximal prime gaps and first occurrences, Mathematics of Computation, 
68:227, (1999) 1311-1315. 

[5] Marek Wolf, Some conjectures on the gaps between consecutive primes, preprint IFTUWr 894/95 
(1995). 

[6] A. Odlyzke, M. Rubenstein and M. Wolf, Jumping Champions, Experimental Mathematics, 8 (1999) 



107-118. Also available at ' http://www.research.att.com/~amo/doc/numbcrtheory.htm] 



4 



